BANFF: An R Package for BAyesian Network Feature Finder
نویسندگان
چکیده
Feature selection on high-dimensional networks plays an important role in understanding of biological mechanisms and disease pathologies. It has a broad range of applications. Recently, a Bayesian nonparametric mixture model (Zhao, Kang, and Yu 2014) has been successfully applied for selecting gene and gene sub-networks. We extend this method to a unified approach for feature selection on general high-dimensional networks; and we develop a powerful R package, the Bayesian network feature finder (BANFF), providing a full package of posterior inference, model comparison, and graphical illustration of model fitting. In BANFF, we develop a parallel computing algorithm for the Markov chain Monte Carlo (MCMC) based posterior inference and an ExpectationMaximization (EM) based algorithm for posterior approximation, both of which greatly reduce the computational time for model inference. In this work, we provide detailed instruction on how to use the R functions in BANFF along with several tutorial examples on analysis of simulated datasets and real datasets. Particularly, we demonstrate the use of BANFF on selecting features from a protein-protein interaction network and perform brain image segmentations.
منابع مشابه
Learning Bayesian Networks with the bnlearn Package
bnlearn is an R package (R Team 2009) which includes several algorithms for learning the structure of Bayesian networks with either discrete or continuous variables. Both constraint-based and score-based algorithms are implemented, and can use the functionality provided by the snow package (Tierney et al. 2008) to improve their performance via parallel computing. Several network scores and cond...
متن کاملDiagnosis of early acute renal allograft rejection by evaluation of multiple histological features using a Bayesian belief network.
BACKGROUND AND AIMS The development of the Banff classification of renal transplant pathology has allowed the standardisation of approaches to transplant biopsy histology and reduced interobserver and interdepartmental variation. The usefulness of the Banff classification in the diagnosis of acute rejection has previously been tested by sending sections from 21 "difficult" biopsies to almost al...
متن کاملBNArray: an R package for constructing gene regulatory networks from microarray data by using Bayesian network
UNLABELLED BNArray is a systemized tool developed in R. It facilitates the construction of gene regulatory networks from DNA microarray data by using Bayesian network. Significant sub-modules of regulatory networks with high confidence are reconstructed by using our extended sub-network mining algorithm of directed graphs. BNArray can handle microarray datasets with missing data. To evaluate th...
متن کاملEnsemble Classification and Extended Feature Selection for Credit Card Fraud Detection
Due to the rise of technology, the possibility of fraud in different areas such as banking has been increased. Credit card fraud is a crucial problem in banking and its danger is over increasing. This paper proposes an advanced data mining method, considering both feature selection and decision cost for accuracy enhancement of credit card fraud detection. After selecting the best and most effec...
متن کاملBayesian Sample Size Determination for Joint Modeling of Longitudinal Measurements and Survival Data
A longitudinal study refers to collection of a response variable and possibly some explanatory variables at multiple follow-up times. In many clinical studies with longitudinal measurements, the response variable, for each patient is collected as long as an event of interest, which considered as clinical end point, occurs. Joint modeling of continuous longitudinal measurements and survival time...
متن کامل